Search for: All records

Creators/Authors contains: "Shah, D."

« Prev Next »

Total Resources

5

Resource Type
Conference Paper

4

Conference Proceeding

0

Dataset

0

Journal Article

1

Workshop Report

0

Availability
Full Text / Resource Available

5

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Causal imputation via synthetic interventions”, Causal Learning and Reasoning

Squires, C. ; Shen, D. ; Shah, D. ; Uhler, C. ( January 2022 , Proceedings of Machine Learning Research)

Consider the problem of determining the effect of a compound on a specific cell type. To answer this question, researchers traditionally need to run an experiment applying the drug of interest to that cell type. This approach is not scalable: given a large number of different actions (compounds) and a large number of different contexts (cell types), it is infeasible to run an experiment for every action-context pair. In such cases, one would ideally like to predict the outcome for every pair while only needing outcome data for a small _subset_ of pairs. This task, which we label "causal imputation", is a generalization of the causal transportability problem. To address this challenge, we extend the recently introduced _synthetic interventions_ (SI) estimator to handle more general data sparsity patterns. We prove that, under a latent factor model, our estimator provides valid estimates for the causal imputation task. We motivate this model by establishing a connection to the linear structural causal model literature. Finally, we consider the prominent CMAP dataset in predicting the effects of compounds on gene expression across cell types. We find that our estimator outperforms standard baselines, thus confirming its utility in biological applications.
more » « less
Full Text Available
A Computationally Efficient Method for Learning Exponential Family Distributions

Shah, A ; Shah, D ; Wornell, G. W. ( December 2021 , Advances in neural information processing systems)

We consider the question of learning the natural parameters of a k parameter \textit{minimal} exponential family from i.i.d. samples in a computationally and statistically efficient manner. We focus on the setting where the support as well as the natural parameters are appropriately bounded. While the traditional maximum likelihood estimator for this class of exponential family is consistent, asymptotically normal, and asymptotically efficient, evaluating it is computationally hard. In this work, we propose a computationally efficient estimator that is consistent as well as asymptotically normal under mild conditions. We provide finite sample guarantees to achieve an l2 error of α in the parameter estimation with sample complexity O(poly(k/α)) and computational complexity O(poly(k/α)). To establish these results, we show that, at the population level, our method can be viewed as the maximum likelihood estimation of a re-parameterized distribution belonging to the same class of exponential family.
more » « less
Full Text Available
Causal imputation via synthetic interventions

Squires, C. ; Shen, D. ; Agarwal, A. ; Shah, D. ; Uhler, C. ( January 2022 , Causal Learning and Reasoning 2022)

Full Text Available
On Learning Continuous Pairwise Markov Random Fields

Shah, A ; Shah, D ; Wornell, G. W. ( April 2021 , Proc. Int. Conf. Artif. Intell., Stat. (AISTATS-2021), Proc. Mach. Learn. Res. (PMLR))
null (Ed.)
We consider learning a sparse pairwise Markov Random Field (MRF) with continuous valued variables from i.i.d samples. We adapt the algorithm of Vuffray et al. (2019) to this setting and provide finite- sample analysis revealing sample complexity scaling logarithmically with the number of variables, as in the discrete and Gaussian settings. Our approach is applicable to a large class of pairwise MRFs with continuous variables and also has desirable asymptotic properties, including consistency and normality under mild conditions. Further, we establish that the population version of the optimization criterion employed by Vuffray et al. (2019) can be interpreted as local maximum likelihood estimation (MLE). As part of our analysis, we introduce a robust variation of sparse linear regression à la Lasso, which may be of interest in its own right.
more » « less
Full Text Available
Preferences Implicit in the State of the World

R. Shah, D. Krasheninnikov ( January 2019 , International Conference on Learning Representations (ICLR))

Full Text Available